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It is now well known that gene expression is intricately regulated inside each cell especially 
in mammals. There are multiple layers of gene regulation active inside a cell at a given 
point of time. Gene expression is regulated post-transcriptionally by microRNAs and other 
factors. Mechanistically, microRNAs are known to bind to the 3' UTR of mRNAs and 
cause repression of gene expression and the number of known microRNAs continues 
to increase every day. Dysregulated microRNA signatures in different types of cancer are 
being uncovered consistently implying their importance in cellular homeostasis. However 
when studied in isolation in mouse models, clear-cut cellular and molecular mechanisms 
have been described only for a select few microRNAs. What is the reason behind this 
discrepancy? Are microRNAs small players in gene regulation helping only to fine tune gene 
expression? Or are their roles tissue and cell type-specific with single-cell level effects on 
mRNA expression and microRNA threshold levels? Or does itall come down to the technical 
limitations of high-throughput techniques, resulting in false positive results? In this review, 
we will assess the challenges facing the field and potential avenues for resolving the cellular 
and molecular mechanisms of these small but important regulators of gene expression. 
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BACKGROUND 

microRNAs (miRNAs) are short single stranded RNAs that reg- 
ulate mRNA expression at the post-transcriptional level. The 
very first miRNA:mRNA target pair was discovered in the labs 
of Victor Ambros and Gary Ruvkun {lin-4 and lin-14, respec- 
tively) who were studying the regulation of developmental timing 
in C. elegans (Lee etal, 1993; Wightman etal., 1993). The 
mature miRNA is a single stranded ~22 nucleotide RNA that is 
sequentially processed from a primary transcript (pri-miRNA) 
and the resulting stem cell loop structure (pre-miRNA) by the 
proteins Drosha and Dicer respectively (Kim etal., 2009). The 
basic principle of miRNA mediated regulation- that miRNAs tar- 
get mRNAs for repression by binding to the latter 's 3'UTR- was 
recognized in these early studies, but the pervasiveness of this 
mechanism in gene regulation was not recognized until much 
later. The number of miRNAs in humans and mice is con- 
stantly being revised, and it is likely that much of the genome is 
under the control of miRNA-mediated gene expression regulation 
(Kim etal, 2009). 

Dysregulation of miRNAs has been observed in a number of 
diseases, especially hematopoietic malignancies like chronic lym- 
phocytic leukemia (CLL) and epithelial malignancies such as lung 
cancers and breast carcinomas (Calin and Croce, 2006). However, 
the exact cause-effect mechanism has not been established for 
many miRNAs and diseases despite significant efforts into such 
research. Indeed, while numerous cancer-causing or promoting 
miRNAs have been proposed on the basis of their dysregulation 



in disease states, few have been established as truly oncogenic or 
tumor suppressive. Implicit in this observation is the question 
of whether we should be interpreting the roles of miRNAs in 
disease states in the same way we evaluate the role of proteins 
in oncogenesis, and what molecular and cell biological mecha- 
nisms distinguish miRNAs as novel factors in disease causation 
and progression. In this review, we will evaluate these questions 
critically, using hematologic cancers as an example. We specu- 
late that the paucity of definitive experimental evidence is likely 
attributable to a combination of several factors- including, but 
not limited to, the inherent genetic redundancy of miRNAs, non- 
genetic heterogeneity of cells and our technical limitations in small 
RNA profiling. We begin with a brief primer on the complexity of 
miRNA based regulation and then examine research findings with 
the aim of understanding the limitations of current strategies to 
study miRNAs and exploring alternatives for the future. 

GENERATING COMPLEXITY: HOW THE BIOLOGY OF miRNAs 
IMPACTS INTERPRETATION OF PHEN0TYPES 

miRNA biogenesis and mechanism of action have been exten- 
sively studied and detailed mechanisms are reviewed elsewhere, 
but we review some of the mechanisms whereby complexity in 
gene regulation may be generated by miRNAs (Kim et al., 2009; 
Krol et al., 2010). The gene structure of miRNAs is the first source 
of complexity. Briefly, miRNAs can be encoded within an exon of a 
unique transcript, can occur as "polycistronic" miRNAs, or can be 
found within the introns of protein-coding or non-coding genes. 
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The primary transcript undergoes endoribonucleolytic cleavage 
by Drosha/DGCR8 in the nucleus, and then in the cytoplasm. 
Intronic microRNAs (mirtrons) have also shown to be generated 
by a non-Drosha/DGCR8 mediated, Dicer only dependent mech- 
anism (Castellano and Stebbing, 2013). Only one of the strands 
(the guide strand) thus generated is incorporated into the RNA 
induced silencing complex (RISC). Although the passenger strand 
is usually degraded, there are now reports emerging that show 
an important role for the passenger strand in gene regulation 
(Almeida etal, 2012; Epis etal., 2012; Uchino etal., 2013). The 
two strands of the microRNA are termed 5p and 3p depending 
on their orientation. Recent reports suggest that incorporation 
of either strand into the RISC complex is possible. Examples 
include miR-142 and miR-17 where incorporation of both the 
strands into the RISC complex occurs, leading to the presence of 
both 5p as well as 3p transcripts inside the cell (Kasashima et al., 
2004; Yang etal, 2013). This introduces yet another layer of reg- 
ulation since the 5p and 3p microRNAs would target different 
genes because of the difference in their seed sequence. This is the 
case with miR-28 where overexpression of miR-28-5p and miR- 
28-3p caused different effects in colorectal cancer cells (Almeida 
etal., 2012). Interestingly, miR-17-5p as well as miR-17-3p were 
shown to target the same gene TIMP3 and induced prostate can- 
cer growth (Yang etal, 2013). All of these steps are subject to 
regulation and can therefore create intricate patterns of miRNA 
expression. 

Once the miRNA is incorporated into RISC, it causes repres- 
sion of target gene expression. miRNAs usually target the 3'UTR 
(untranslated region) of the mRNA. Perfect complementarity 
between the miRNA and its target leads to destruction of the 
RNA by the "sheer" activity in the Argonaute-2 protein which 
is part of the RISC (Meister, 2013). This type of repression is 
called RNAi and is very rare in mammals but more commonly 
seen in plants. More commonly, incomplete complementarity 
between the miRNA and its target is seen leading to transla- 
tional repression. Proposed mechanisms include interference of 
the miRNA with the formation of a looped mRNA structure 
required for optimal translation, but further study is needed (Kim 
etal., 2009). Nonetheless, the net effect of miRNA expression 
is thought to be repression of the target gene. The repression 
is thought to be limited in extent (i.e., the gene is partially 
repressed) because miRNAs are induced and degraded over 
time. 

Although binding to the 3'UTR and repression of the gene 
is considered to be the current canonical pathway for miRNA 
mediated gene regulation, non-canonical binding and targeting 
has also been described for microRNAs. Some microRNAs have 
been shown to target the promoters of genes and cause either 
gene activation (miR-373, miR-744, miR-1186) or gene repres- 
sion (miR-320, miR-373). Gene activation was accompanied by 
the presence of active histone marks (H3K4 trimethylation) while 
gene repression was caused by a closed chromatin structure due 
to H3K9 dimethylation (Huang and Li, 2012). Recently, CLASH 
(cross-linking, ligation, and sequencing of hybrids) was used 
along with overexpression of an Agol fusion protein and high- 
throughput sequencing to map the miRNA-interactome. This 
revealed that even though the majority of the miRNA binding 



sites were in the 3'UTRs, microRNAs also bound to 5'UTRs, gene 
bodies, long non-coding RNAs (IncRNAs) as well as introns. There 
was also a class of microRNAs whose mRNA interactions did not 
involve the seed region (miR-92a), or so-called "seedless binding" 
(Helwak etal, 2013). 

The temporal patterns of miRNA expression add signifi- 
cantly to the complexity of the problem. Some miRNAs are 
strongly induced by stimuli such as LPS (miR-155; O'Connell 
etal, 2009) whereas others are expressed at baseline levels and 
seem to maintain fairly constant levels during developmental 
processes (Pauli etal, 2011; Guan etal., 2013). In addition 
to their biogenesis, which can be regulated transcriptionally 
and post-transcriptionally at the level of RNA processing (van 
Kouwenhove et al, 201 1), their degradation can be regulated both 
by proteins, as is the case with Xrn2 (Grosshans and Chatter- 
jee, 2010), and by cell divisions, which dilute out the pool of 
miRNAs if their rate of production is less than the sum of dilu- 
tion and degradation. As one considers disease-related changes 
in miRNA expression, understanding the temporal expression 
pattern, particularly whether phenotypes may be induced by 
various biological stimuli, impacts the interpretation of pheno- 
typic changes. As an example, miR-155-deficient mice showed 
normal baseline levels of B-cells and T-cells, but these mice 
clearly had major defects in immune responses (Thai etal., 
2007; O'Connell etal, 2010b; Huffaker etal., 2012). It is clear 
that miR-155 has major functions when induced, and that 
these functions have importance in the context of oncogene- 
sis. 

Adding to the complexity of miRNA mediated repression is 
computational and experimental evidence that a single miRNA 
can target hundreds or potentially thousands of mRNA molecules. 
The stoichiometry of such inhibition is incompletely understood, 
and it would seem that any two targets could be inhibited to differ- 
ent degrees by a given miRNA. Given computational algorithms 
that are imperfect and the lack of a methodology to predict the 
magnitude of repression, both the identity of and extent of effect 
on target gene expression have to be defined in an individual fash- 
ion by experimentation. Hence, it would be extremely difficult to 
predict the effects of miRNA expression on gene expression at the 
global level. 

Post-transcriptional gene regulation by miRNAs is further 
modulated by RNA binding proteins (RBPs). Numerous RBPs 
have recently been characterized recently and many play a role 
in mRNA stability. A target mRNA can be competitively bound by 
a RBP and a miRNA, leading to a reduction in repression levels 
by the miRNA. RBP binding can also cause a switch in mRNA 
secondary structure, causing steric hindrance to miRNA binding. 
One example is the binding of IGF2BP 1 (Insulin like growth factor 
2 binding protein 1) to the BTRC and MITF mRNAs, preventing 
the binding of microRNAs 183 and 340 respectively. This leads to 
de-repression of these genes and subsequent downstream effects 
(Elcheva et al., 2009; Goswami et al., 2010; van Kouwenhove et al., 
2011). 

Lastly, there is a consideration about miRNA-based regulation 
that seems obvious but has important implications. miRNAs can 
only target transcripts that have been induced in a particular devel- 
opmental or pathological context; they cannot, a priori, cause a 
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gene to be expressed. Hence, the set of genes that a miRNA reg- 
ulates in a given cell is highly dependent on the transcriptional 
program of that particular cell. Finding functional targets in one 
cell type does not necessarily implicate the same targets in a dif- 
ferent cell type. Similarly, signaling pathways implicated in one 
cell type may not be operant in a different cell type. This has 
been highlighted by the germline knockout of miR-146a, which 
demonstrated a phenotype that was almost exclusively restricted 
to the hematopoietic system (Boldin et al., 201 1; Zhao et al., 201 1). 
This has important implications for how one predicts the effects 
of miRNA manipulation across different tissue types.. This also 
emphasizes the fact that microRNA targets will most probably be 
tissue type specific. So microRNA: mRNA interaction studies need 
to be performed specific to cell types and their results interpreted 
according to the tissue of origin. 

miRNAs AND CANCER: PROFILING STUDIES 

After the recognition of miRNAs as being widespread in the 
genome of humans, there was a keen interest to identify dysreg- 
ulated miRNA profiles in a number of diseases, especially cancer. 
Altered miRNA profiles were indeed found in many cancers (Calin 
and Croce, 2006). For example miR-21 was found to be up reg- 
ulated in a number of malignancies like glioblastomas, breast, 
colon and pancreatic cancer and has been anointed as an onco- 
genic miRNA (Garzon etal, 2009). The miR 17-92 cluster was 
up regulated in lymphomas and in breast, lung, colon, stom- 
ach, and pancreatic cancers (Calin and Croce, 2009; Garzon et al., 
2009). MiR- 155 was found to be highly expressed in certain types 
of B-cell lymphomas, including CLL with an aggressive course and 
diffuse large B-cell lymphoma (Calin etal., 2004; Eis etal, 2005; 
Chen etal., 2008). Numerous other miRNAs have also been iden- 
tified in patient-based samples as being dysregulated in disease 
states. 

Similarly, some miRNAs are down regulated in tumors and 
are therefore classified as tumor suppressor miRNAs. The classic 
examples include miR-15a and miR-16-1 loss in CLL and multiple 
myeloma, let-7 miRNA loss in lung and breast cancers (Garzon 
etal., 2009). miR-34 is a p53 responsive miRNA family and its 
members, notably miR-34a, have been observed to be lost in pan- 
creatic, colon, breast and liver cancers, and is a predictor of poor 
prognosis in CLL (Chim etal, 2010; Christoffersen etal, 2010; 
Corney etal, 2010; Hermeking, 2010; Choi etal., 2011; Fabbri 
etal., 2011). In myelodysplastic syndrome, miR-146a was down- 
regulated, and at least part of the time, this occurs as a consequence 
of deletion of the long arm of chromosome 5 in the malignant 
hematopoietic progenitor cells (Starczynowski etal., 2010; Boldin 
etal, 2011; Zhao etal, 2013). 

The last 10 years have seen a proliferation of miRNA profiling 
studies, and while many miRNAs are dysregulated at the level of 
their expression, it is unclear how miRNAs become downregu- 
lated in specific cancers. Until recently, the chromosomal deletion 
of miR-15a/16 (chromosome 12p) and miR-146a (chromosome 
5q) comprised the only two recurrent genetic abnormalities that 
can be traced as a direct cause for miRNA downregulation in the 
hematopoietic system and as noted below, in cancer pathogene- 
sis. Earlier last year, whole genome sequencing of de novo acute 
myeloid leukemia (AML) revealed that miR- 142 undergoes point 



mutations, potentially resulting in loss of targeting by the 3p form 
(Cancer Genome Atlas Research Network, 2013). Further work 
has shown that miR- 142 mutations in AML are localized to the 
seed region of miR-142-3p, leading to reduced expression of miR- 
142-5p. It is an interesting case where the stability of one form of 
the miRNA is dependent on the other form (Trissal etal, 2013). 
Other possible mechanisms include dysregulated miRNA expres- 
sion resulting downstream of mutations in transcription factors. 
Since epigenetic studies of gene expression regulation have largely 
focused on protein-coding genes (where rules for histone modifi- 
cations are better understood), annotations of miRNA promoters 
and enhancers have lagged behind those in the protein-coding 
genes. Therefore, further work in elucidating mechanisms of 
miRNA dysregulation and their integration into large scale genetic 
networks that are dysregulated in cancer is required. 

miRNAs: SMALL PLAYERS IN PATHOGENESIS? 

As expected, altered miRNA profiles have been observed in numer- 
ous malignancies. However, there are only a few malignancies 
where a causal relationship has been directly established for the 
dysregulation of the miRNA and tumor development. A general 
role for the miRNA pathway in oncogenesis was established by 
the observation that mice heterozygous for Dicer developed an 
increased incidence of tumors (Kumar etal, 2009; Ravi etal., 
2012). Hence, global abnormalities in miRNA expression are 
indeed causally linked to an increased incidence of tumors. 

Among the well-known oncomiRs, overexpression of miR-2 1 in 
mice caused pre-B cell leukemia/lymphoma (Medina et al, 2010). 
In this study, miR-21 was expressed in a Nestin-Cre- inducible 
manner, surprisingly producing B-cell tumors. miR- 155 over- 
expression led to a lymphoproliferative disorder in B-cells and 
eventually acute lymphoblastic lymphoma/leukemia (ALL), and 
myeloproliferative disease when overexpressed in myeloid cells 
(Costinean etal, 2006, 2009; O'Connell etal, 2011; Babar etal, 
2012). Recently, overexpression of miR-22 in hematopoietic cells 
has been shown to lead to MDS (Song et al, 2013). Ectopic expres- 
sion of miR-29a (Han etal., 2010) and enforced expression of 
miR- 125b in hematopoietic stem cells (HSCs) led to a myelo- 
proliferative disorder which progressed to AML (O'Connell et al., 
2010a; Chaudhuri et al., 2012). Overexpression of the miR-17-19b 
cluster (five miRNAs) in the bone marrow of Efi-Myc transgenic 
mice led to an acceleration of lymphoma development (He et al., 
2005). Recently, B-cell specific overexpression of the miR 17-92 
cluster led to lymphoma development, establishing its role as an 
independent oncogene (Jin et al, 2013). Other miRNAs which are 
highly expressed in tumors however, have failed to recapitulate the 
full tumor phenotype. For example, overexpression of the puta- 
tive oncogenic miR-25-106b-93 cluster in mice showed no tumor 
development. 

Deletions of tumor suppressor miRNA in mice have to date 
yielded phenotypes with a long latency. Deletion of the tumor 
suppressor miRNA cluster, encoding miR-15a and miR-16-1 led 
to a CLL like disease in mouse models, albeit at around 18 months 
of age and only in ~30% of the involved mice (Aqeilan et al., 2010; 
Klein etal, 2010). Deletion of the miRNA-146a led to a myelo- 
proliferative phenotype and hematologic malignancies, again late 
in life (~18 months; Boldin etal, 2011; Zhao etal, 2011). Mice 
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with targeted deletions of the following putative tumor suppres- 
sor miRNAs did not show development of an overt malignancy: 
miR-145, miR-223, miR-133, and miR-206 (Park etal, 2010). 
Although the overexpression of miR-29 had been shown to reduce 
liver cancer growth in mice (which is contrary to the role that 
it plays in the hematopoietic system), knockout of the miR-29 
locus led to liver fibrosis and not any overt malignancy (Xiong 
etal., 2010; Roderburg etal., 2011). Perhaps the most surprising 
example of a knockout lacking a phenotype is the miR-34a knock- 
out mouse, which did not develop any overt malignancies despite 
being downstream of arguably the most important tumor sup- 
pressor in the human cell, p53. Indeed, the triple knockout of all 
three members of miRNA family did not lead to an increase in 
cancer incidence (Concepcion et al., 2012). This is despite the fact 
that miR-34a targets numerous transcripts for proteins important 
in cell cycle progression, cell cycle progression, and oncogenesis 
(Siemens et al, 2013). 

From a therapeutic standpoint, overexpression and in vivo 
delivery of miRNAs/ modified small nucleic acid molecules has 
met with some success. The overexpression of tumor suppres- 
sor miRNAs have been discovered to be successful in delaying 
tumor development or reducing the tumor burden in mouse 
models. Examples include miR-34a (lung adenocarcinoma), miR- 
145 and miR-33 (colon carcinoma) and miR-15 and 16-1 (colon 
carcinoma). Systemic delivery of the tumor suppressor let- 7 
increased the latency to lung cancer in mouse models (Croce, 
2009; Nana-Sinkam and Croce, 2013).There are also some in vivo 
studies where knock down of oncomiRs has helped in atten- 
uating tumorigenesis. In a mouse model of Ras-drwen lung 
cancer, miR-2 1 inhibition slowed tumor progression (Hatley et al., 
2010). Administration of antagomiRs in mice to block miR-lOb 
and miR-223 resulted in significant antitumor activity against 
breast carcinoma and hepatocellular carcinoma respectively 
(Garzon etal, 2010). 

In essence, not all dysregulated miRNAs discovered by expres- 
sion profiling of tumors are pathogenetic and many do not have a 
clear mechanism of action in oncogenesis. In general, cancer phe- 
notypes driven by overexpression seem to be more severe than 
those generated by loss-of-function, and the latter seem to be 
highly age dependent. These findings may be explained by the 
fact that miRNAs can only act on a "pre-existing" transcriptional 
program. Hence, miRNA gain of function in a cell type could 
lead to aberrant functions by targeting genes that are not normally 
repressed, while loss of function would only affect cells where the 
appropriate target genes and the miRNA of interest are concur- 
rently expressed. This again brings forward the idea that miRNA 
gain and loss-of-function should be studied in the appropriate sys- 
tem and the appropriate biological context. One should also bear 
in mind the fact that microRNA overexpression or knockdown 
phenotypes have little utility in the absence of human disease 
specific dysregulation. Hence animal models and human stud- 
ies should be used together to unravel the mechanism behind 
dysregulated miRNAs. 

miRNA PROFILING: USEFUL BUT NOT FOOLPROOF 

Profiling of tumors provides patient-based data on miRNA expres- 
sion patterns, which is undeniably important, but this presents 



its own set of problems. Centrally, it is very difficult to control 
for variability in collection of tumor cells between patients, as 
opposed to the ease of collecting high-quality controlled samples 
under experimental conditions. While it seems that miRNAs are 
better preserved between samples, as small RNAs demonstrate 
more stability than long RNAs, the techniques used to detect 
miRNAs also engender some variability. The common techniques 
for high-throughput profiling include microarrays, RNA-Seq and 
qPCR arrays. Cost and ease of data analysis remain the two 
main criteria which determine the type of platform utilized. 
The gold standard for individual miRNA detection remains the 
Northern blot and for samples with a low RNA yield (patient 
samples), reverse transcriptase (RT)-qPCR is the technique of 
choice for miRNA quantitation (Pritchard etal, 2012). How- 
ever, interpretation of fold changes obtained from RT-qPCR 
experiments should always consider absolute levels of miRNA 
expression in control cells. For example, the biological impact 
of a 10-fold increase of a low expressed miRNA (1 molecule 
per cell increases to 10 molecules per cell) may be less sig- 
nificant than a 1.5 -fold increase of a highly expressed miRNA 
(50,000 molecules per cell to 75,000 molecules per cell), depend- 
ing on the relative expression levels of their mRNA targets. A 
last consideration in qPCR is the use of appropriate normaliza- 
tion techniques, investigators should consider the use of multiple 
reference genes or mean expression value of all expressed microR- 
NAs in a sample as a normalization factor (Mestdagh etal., 
2009). 

Technically, both the RT step and the amount of RNA can be a 
source of significant bias during RNA profiling. A study compar- 
ing the efficiency of different types of RT enzymes showed around 
a 100-fold difference between enzymes for some genes (Stahlberg 
etal, 2004). This effect was particularly pronounced for mRNAs 
whose expression is low. Introduction of errors due to secondary 
structure, differences in priming efficiency and properties of the 
enzyme itself (like RNAse H activity) can all influence the product 
yield from the RT reaction. Also some miRNAs are GC rich and 
have a higher Tm which is a reason for reduced reverse priming 
efficiency and consequently smaller representation in the cDNA 
library. This effect will be much more pronounced when the quan- 
tity and quality of the RNA is low (for example, paraffin embedded 
samples). Similar to the RT reaction, each reaction in the sample 
preparation workflow may lead to variations including the adap- 
tor ligation step using RNA ligase, linear amplification step as well 
as the fiuorochrome labeling step for microarrays (Stahlberg et al., 
2004; Baker, 2010). 

With so many commercial options available for miRNA profil- 
ing, it is somewhat alarming that different groups have reported 
poor reproducibility across these different platforms. To avoid this 
problem, it has been suggested that once a certain profiling method 
has been standardized in the lab, maintaining the same technique 
is advisable to eliminate bias across methods (Git etal., 2010). 
Earlier quality control studies showed a big variation in the pro- 
filing of miRNAs by high-throughput sequencing especially when 
the quantity of transcripts was not abundant. However, encourag- 
ingly, recent reports on quality control of RNA-seq using newer kits 
suggests that current platforms are reproducible and comparable 
across each other (Adiconis et al., 2013). In addition to the range of 
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platforms, data analysis can differ significantly between studies as 
well as labs, particularly in the stringency of false discovery rates 
and in the numerical level of difference between the samples. This 
too could contribute to the seemingly disparate results obtained 
by different groups. 

Apart from these technical considerations, other factors may 
also lead to difficulty in fully understanding miRNA expression 
data. For example, gene duplications have led to miRNAs origi- 
nating from different chromosomal loci, and each locus is subject 
to unique regulation. At present, we can only quantify the total 
amount of mature miRNA being produced because routine pro- 
filing studies do not distinguish which locus they originated from. 
Moreover, researchers have identified so-called IsomiRNAs, which 
are miRNAs that differ from their cognate miRNA by a few 
nucleotides either at the 3' or even the 5' end. The presence of 
IsomiRNAs could certainly cause a bias in mature miRNA quan- 
titation by these high-throughput methods (Vickers etal., 2013). 
Hence, factors intrinsic to the roles of miRNAs in cells can cause 
variability in data generated by high-throughput studies. 

Several proposals have been made to improve technical issues 
with miRNA based profiling techniques. Some laboratories 
have advocated the use of external reference samples for the 
exact quantitation of miRNAs (Git etal, 2010). But these ref- 
erence samples do not mimic conditions inside the human 
body: most importantly, total RNA samples probably contain 
less than 0.01% miRNA (Git etal, 2010). There is ribosomal 
RNA, tRNA and mRNA that could compete for binding with 
preparative enzymes and thereby could potentially affect results. 
This is being negated to a certain extent now by removing 
the abundant ribosomal RNA before using the RNA for pro- 
filing (Git etal., 2010). It would be ideal if guidelines similar 
to the MIQE guidelines (minimum information for publication 
of quantitative real-time PCR experiments) were made avail- 
able for miRNA profiling techniques (Bustin etal., 2013). This 
would ensure better experimental practice allowing more reliable 
comparative analysis of miRNA profiling data (Witwer, 2013). 
Hence, having a carefully curated database using uniform ana- 
lytic techniques would go a long way toward removing obstacles 
that confound efforts to jump from profiling studies to func- 
tional analyses characterizing dysregulated miRNAs in cancer 
states. 

Apart from high-throughput RNA sequencing, other novel 
techniques are also being employed to delineate targets of 
microRNAs. One of these is cross-linking immunoprecipitation- 
sequencing (CLIP-seq), where small RNAs and mRNAs bound to 
one of the proteins in the RISC, usually Ago- 1, are cross-linked 
by ultraviolet radiation, pulled down using an anti-Agol anti- 
body and then sequenced (Mittal and Zavolan, 2014). RISC-seq 
is a method similar to CLIP-seq using anti-Argonaute antibody 
pull-down followed by reverse transcription and sequencing. The 
difference here is the absence of any cross-linking (Matkovich 
etal., 2011). As mentioned previously, CLASH utilizes a similar 
cross-linking technique but ligates the miRNA and the mRNA 
using RNA ligase. While the specificity of CLASH is high, its sen- 
sitivity is reported to be very low (~2%; Helwak etal, 2013). 
The usage of these techniques in parallel with gene expres- 
sion analyses in the setting of miRNA gain- or loss-of-function 



will likely increase the specificity of the observed microRNA 
targets. 

miRNAs AND REDUNDANCY 

Here, we turn to some possible reasons as to why we see small 
phenotypes in miRNA deficient mice. The first of these relates to 
redundancy of both miRNAs and targeting of particular mRNAs 
by different miRNAs. In addition to there being multiple genomic 
loci producing the same miRNA, many miRNAs are part of miRNA 
"families," which target the same seed sequence. Hence, there is a 
great deal of redundancy at the genomic level. Because the seed 
sequence is small, a single miRNA can target a large number of 
mRNAs (Thomson etal, 2011). The opposite is also true, that a 
single mRNA can be targeted by numerous miRNAs, including 
those from different families. 

There are a number of miRNA target prediction tools avail- 
able online including but not limited to TargetScan, PicTar, and 
Miranda (Ritchie et al., 2013). Confirmation of direct miRNA tar- 
geting of mRNA is usually done by a luciferase reporter-3'UTR of 
the mRNA construct and overexpressing the miRNA. However, not 
all of the predicted mRNA targets are actual targets of the miRNA. 
So what factors decide whether a miRNA will target a particu- 
lar mRNA? There are many factors which likely play a role beyond 
seed matching and binding of complementary RNA strands. These 
may include structural accessibility of the 3'UTR of the mRNA to 
miRNAs, epigenetic modifications in targeted region preventing 
miRNA binding and the presence of miRNA sponges called com- 
peting endogenous RNAs (ceRNAs) which preferentially bind to 
miRNAs thereby de-repressing their mRNA targets (Karreth and 
Pandolfi, 2013). The prediction of miRNA- deficient phenotypes 
suffers from this lack of predictability of target repression. 

Knocking out a single genetic locus encoding a miRNA may not 
be sufficient to fully understand the function of a given miRNA. 
Conceptually, this is similar to the situation of genetic redundancy 
seen with protein-coding genes, but it is different because miRNAs 
are most commonly found as families, and the majority of miRNAs 
share seed sequences with other microRNAs. Another important 
thing to note is the unperturbed levels of the miRNA in a particular 
tissue type. For example, miR-122 has been shown to account for 
nearly 50% of the total hepatocyte miRNA pool. In this situation, 
one could imagine that a knockout would have significant effects 
on gene expression (Szabo and Bala, 2013). 

An ideal loss-of-function study will concurrently target all 
redundant miRNAs for deletion. This requires a large amount 
of resources to generate conventional knockout mice, which can 
be prohibitive to undertake such a project. However, with the 
advent of novel knockdown technologies like ZFNs (zinc fin- 
ger nucleases), CRISPRs (clustered regularly interspaced short 
palindromic repeats) and TALENs (transcription activator-like 
effector nucleases), these complicated knockout animals may now 
be more feasible to create and study (Gaj etal., 2013). Alter- 
native and novel approaches, such as designing ceRNAs that 
can target entire families of miRNAs, should also be consid- 
ered and carefully examined. This approach could provide a 
viable method to knock down a large number of miRNAs at 
the same time. Lastly, the role of specific miRNA-target interac- 
tions should be carefully studied in carefully selected systems. In 
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certain cases and in certain developmental/ pathologic contexts, 
a single miRNA-target pair may be of paramount importance 
(Xiao etal., 2007; O'Connell etal., 2009; Xiao and Rajewsky, 
2009; Rao etal, 2010). Such interactions should be further 
studied using targeted mutations of the miRNA binding site 
in 3'UTRs, as described for miR-155 and AID (Borchert etal., 
2011). 

In most currently utilized techniques, the extent of overexpres- 
sion is usually higher than physiologically attainable limits. Such 
overexpression may overwhelm the miRNA processing machin- 
ery and cause off-target effects (Singh etal, 2011; Karlsen and 
Brinchmann, 2013). Hence, some of the phenotypes that have 
been elaborated with overexpression may be more striking simply 
because of the extent of overexpression. Usage of endogenous pro- 
moters which are weaker than the conventional ones or using an 
inducible promoter which follows linear kinetics would be ideal, 
but this remains technically challenging. Yet, many of these exper- 
iments have revealed the central importance of miRNAs in driving 
oncogenesis. 

NON-GENETIC VARIATION AND THE THRESHOLD EFFECT 

Recent studies have examined the role of variability of gene expres- 
sion in the generation of different cell types during differentiation, 
and presumably similar mechanisms are operant in cancer cau- 
sation and progression. Even when a tissue contains cells of 
the same type, the expression of a particular protein can show 
stochastic cell-to-cell as well as temporal fluctuations within the 
same cell. Sometimes these fluctuations were up to ~1000-fold 
between the low expressing and high expressing cells at one point 



of time (Chang et al., 2008). These differing gene expression levels 
might be purely stochastic or represent phenotypic variants within 
the same population (Mukherji etal, 2011). Recent reports sug- 
gest that such non-genetic heterogeneity might be generated by 
miRNA-mediated effects on gene expression, and that this may 
depend on the number of miRNA binding sites in the 3'UTR, the 
level of mRNA in the cell as well as the level of miRNA in the 
cell. 

It has been demonstrated that at a constant level of miRNA, 
there is a particular threshold of mRNA below which the repres- 
sion by the miRNA is dramatically increased (Mukherji etal., 
2011). On the opposite end of the spectrum, past a certain 
concentration of mRNA, miRNA-mediated repression is virtu- 
ally ineffective. Similarly when the number of binding sites in the 
3'UTR for the miRNA was increased from 1 to 7 in the UTR, the 
repression increased by 40-fold (Chang etal, 2008). This thresh- 
old effect between the miRNA and the mRNA is reminiscent of 
biochemical precipitation reactions involving antigen- antibody 
interactions at the optimum concentration of both the antigen as 
well as the antibody. The precipitin line forms only at this zone 
of equivalence and neither in the region of antigen excess nor in 
antibody excess (Hornbeck, 2001). Hence, there are several rela- 
tionships between the miRNA and mRNA that can profoundly 
change the distribution of gene expression within a population of 
cells. 

It is implied that a cell requires a particular level of mRNA 
and miRNA to coexist for repression to occur. The net effect of 
these threshold effects is that our conventional thinking about 
miRNA- that they partially repress gene expression- may only 
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FIGURE 1 1 Cellular heterogeneity in microRNA levels: Hypothetical cell 
groups (A,B) have the same average number of a particular microRNA per 
cell (average = 4). SNA extraction and qPCR of both these groups of cells 
would give the same microRNA expression in both groups. However, a closer 



look at the two groups shows a bimodal type of microRNA expression pattern in 
groupA and a homogenous expression pattern in group B.This cell to cell 
difference in microRNA levels which might lead to different downstream gene 
expression is often overlooked in the absence of single-cell studies. 
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hold for intermediate concentrations of the miRNA and the target 
mRNA. The implications for experimental science are important: 
we have to be cognizant of the level of the target mRNA in the cells 
that are being targeted for perturbation of the miRNA. In experi- 
ments with miRNA overexpression or knock down, most groups 
usually collect the RNA from an entire population of cells, reveal- 
ing a snapshot of the average gene expression in the cells. This 
certainly doesn't tell us the entire story- different cells with differ- 
ing unperturbed target mRNA levels would have different levels 
of change, depending on the mRNA threshold (Figure 1). There- 
fore, single-cell expression studies are necessary to study the cell to 
cell variation in response to miRNAs. Recent reports have shown 
that miRNA expression can be studied at the single-cell level using 
microfluidics (Wu etal., 2013). This, coupled with lineage trac- 
ing studies may help us understand how phenotypes develop in 
miRNA-deficient or overexpressing cells, both in development and 
in disease. The threshold effect is an important consideration in 
miRNA therapeutics- particularly against cancers. Overexpression 
of miRNAs might be highly efficient at repressing the expression 
of a critical target mRNA in cells that express the target at the opti- 
mum level (Figure 2). However cells with extremely high levels of 
the target mRNA would escape repression, if the above-proposed 
mechanisms are correct, and may even be selected for. In theory, 
this could lead to the elaboration of a more aggressive cancer 
following an initial remission. Hence, targeted therapy utilizing 



small RNA molecules requires careful consideration of ranges of 
target gene expression in the cancer cells. 

CONCLUSION 

The plethora of profiling experiments has to date yielded impor- 
tant data about miRNA expression in primary human disease 
states, but we suggest that these constitute a starting point for 
disease-relevant studies. A major obstacle to understanding these 
miRNAs in vivo is the resource set required to individually target 
these miRNAs and miRNA-target pairs. However, it seems that 
such detailed studies will be required to understand the roles of 
these small RNAs in disease pathogenesis and for their utilization 
as therapeutic interventions. 

We have tried to provide an overview of some of the fac- 
tors that we believe to be obstacles in moving miRNA research 
from the bench to the bedside (or vice-versa). These include 
technical difficulties and variations in high-throughput profil- 
ing studies between labs and across platforms. These could be 
standardized in the future with rigorous quality control, bet- 
ter kits and standard guidelines for sample preparation and 
data analysis. The question of genetic redundancy in miR- 
NAs calls for more complex mouse models which are also 
foreseeable in the future. Non-genetic heterogeneity presents 
a totally different question altogether. Sophisticated single- 
cell expression studies of mRNA, miRNA and proteins are 
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FIGURE 2 | miRNA:mRNA interaction-the threshold effect:The level of 
repression of gene expression is dependent on the levels of both the 
mRNA as well as the microRNA present inside a particular cell. This 
hypothetical figure assumes that the expression of mRNA X is important for 
hematopoietic cell lineage determination with high, intermediate and low 



levels leading to lymphoid, myeloid and erythroid lineage differentiation 
respectively. Gene X is a known target of the microRNA miR-Y. These effects 
would be missed if single-cell measurements of X and miR-Y were not 
performed which reiterates the necessity for newer techniques and analyses 
especially at the single-cell level. 
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required to pave the way to decipher the actual role of 
non-genetic variants present within what many researchers 
have previously presumed to be homogeneous populations. 
With the advent of more robust and reproducible profil- 
ing techniques, more sophisticated genetic manipulations in 
model systems, and single-cell analytical advances, we antici- 
pate a clearer understanding of miRNA roles in biology and 
pathogenesis. 
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